Automatic Determination of the Standard Chinese Prosodic Phrase Boundaries by F0 Generation Model
نویسندگان
چکیده
We proposed an automatic method for determining the boundaries of prosodic phrases in real speech waves. In this method, the dynamic programming ( DP ) and the least mean square error ( LMSE ) methods were implemented based on the F0 generation model. In order to evaluate the accuracy and validity of this proposed method, a set of 973 standard Chinese speech sentences was selected. The cumulative proportion of the estimated prosodic phrase boundaries approached 76% when ET (0i) was less than the average duration of the prosodic phrases. Thus, it can be concluded that this proposed method can be used in the practical application.
منابع مشابه
An F0 Contour Model in Chinese Based on Templates of Prosodic Words
The problem of F0 contour generation in Chinese are addressed in this paper. An F0 contour model based on templates of prosodic words is proposed. Taking templates of prosodic word F0 contour as the basic units, the basic structure of the model is established with references to the “small ripples on top of big waves theory” and “Fujisaki model”. A three-layer prosodic hierarchy which consists o...
متن کاملThe use of F0 reliability function for prosodic command analysis on F0 contour generation model
This paper describes a method of utilizing an “F0 Reliability Field” (FRF), which we have proposed in our previous work, for estimating prosodic commands on F0 contour generation model. This FRF is the time-frequency representation of F0 likelihood, and an advantage of FRF is that it is not necessary to consider F0 errors that occur during an automatic F0 determination. Therefore, it is thought...
متن کاملAutomatic prosodic segmentation by F0 clustering using superpositional modeling
In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In t...
متن کاملCoPaSul Manual - Contour-based parametric and superpositional intonation stylization
The purposes of the CoPaSul toolkit are (1) automatic prosodic annotation and (2) prosodic feature extraction from syllable to utterance level. CoPaSul stands for contour-based, parametric, superpositional intonation stylization. In this framework intonation is represented as a superposition of global and local contours that are described parametrically in terms of polynomial coefficients. On t...
متن کاملAccent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling
This paper proposes an automatic prosodic labeling technique for constructing speech database used for speech synthesis. In the corpus-based Japanese speech synthesis, it is essential to use annotated speech data with prosodic information such as phrase boundaries and accent types. However, manual annotation is generally time-consuming and expensive. To overcome this problem, we propose an esti...
متن کامل